Query-Topic Focused Web Pages Summarization

نویسندگان

  • Seung Yeol Yoo
  • Achim G. Hoffmann
چکیده

We present a novel Web Pages Summarizer ContextSummarizer that subgroups the given Web pages into ‘sense-clusters’ respecting a user’s topic interests, and constructs a dynamic extractive summary for each sense-cluster. A user’s topic interest is described by the user who selects and refines some of word senses disambiguated within the content contexts of the given Web pages. The semantic similarity measures between the contents of Web pages/segments/sentences and the user-selected/-refined word senses were used to choose the most topically relevant sentences as the extractive summaries referring to a user’s topic interest. As the results, it addressed the dynamic semanticalignment issues between the content of a Web page and the user’s topic interest about that Web page, and between the user’s topic interest and an extractive summary. Some case studies and experimental results showed that query-topic focused extractive summaries returns more topically consistent sentences for an extractive summary.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploiting relevance, coverage, and novelty for query-focused multi-document summarization

Summarization plays an increasingly important role with the exponential document growth on the Web. Specifically, for query-focused summarization, there exist three challenges: (1) how to retrieve query relevant sentences; (2) how to concisely cover the main aspects (i.e., topics) in the document; and (3) how to balance these two requests. Specially for the issue relevance, many traditional sum...

متن کامل

Ontology and Query-Focused Multi-Document Summarization System

Due to the increasing growth of online information on the specific topic, Multiple Document Summarization (MDS) has become a non-trivial task. The MDS facilitates the user to understand the large volume of information in a short time by creating a concise and comprehensive summary. In addition, user’s query based MDS system provides a consistent summary, including the core of the information. T...

متن کامل

Design of a Focused Crawler Based on Dynamic Computation of Topic Specific Weight Table

Abstract Focused Crawler aims to select relevant web pages from internet. These pages are relevant to some predefined topics. Previous focused crawlers have a problem of not keeping track of user interest and goals .The topic weight table is calculated only once statically and that is less sensitive to potential changes in environment. To address this problem we design a focused crawler based o...

متن کامل

Topic-Sensitive Hidden-Web Crawling

A constantly growing amount of high-quality information is stored in pages coming from the Hidden Web. Such pages are accessible only through a query interface that a Hidden-Web site provides and may span a variety of topics. In order to provide centralized access to the Hidden Web, previous works have focused on query generation techniques that aim at downloading all content of a given Hidden ...

متن کامل

Query-focused Multi-Document Summarization: Combining a Topic Model with Graph-based Semi-supervised Learning

Graph-based learning algorithms have been shown to be an effective approach for query-focused multi-document summarization (MDS). In this paper, we extend the standard graph ranking algorithm by proposing a two-layer (i.e. sentence layer and topic layer) graph-based semi-supervised learning approach based on topic modeling techniques. Experimental results on TAC datasets show that by considerin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006